Mapping Parallel Programs into Hierarchical Distributed Computer Systems

نویسندگان

  • Victor G. Khoroshevsky
  • Mikhail G. Kurnosov
چکیده

In most cases modern distributed computer systems (computer clusters and MPP systems) have hierarchical organization and non-uniform communication channels between elementary machines (computer nodes, processors or processor cores). Execution time of parallel programs significantly depends on how they map to computer system (on what elementary machines parallel processes are assigned and what channels for inter-process communications are used). The general problem of mapping a parallel program into a distributed computer system is a well known NP-hard problem and several heuristics have been proposed to approximate its optimal solution. In this paper an algorithm for mapping parallel programs into hierarchical distributed computer systems based on task graph partitioning is proposed. The software tool for mapping MPI applications into multicore computer clusters is considered. The quality of this algorithm with the NAS Parallel Benchmarks is evaluated.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ANDES: Evaluating mapping strategies with synthetic programs

This paper presents the ANDES performance evaluation tool. ANDES is based on the synthetic execution of parallel programs and it is used for the evaluation of mapping strategies. The Meganode, a distributed memory parallel computer, is considered as our target architecture. ANDES takes into account a benchmark of quantitative models of parallel algorithms and a set of mapping strategies (greedy...

متن کامل

Mapping Large Parallel Simulation Programs to Multicomputer Systems

We consider the problem of mapping parallel simulation programs to distributed memory parallel machines. Since a large fraction of computer simulations consists of solving partial differential equations, the communication patterns of the resulting parallel programs can be exploited to construct efficient mappings which lead to low communication overhead. We report about the application of Kohon...

متن کامل

Static Task Allocation in Distributed Systems Using Parallel Genetic Algorithm

Over the past two decades, PC speeds have increased from a few instructions per second to several million instructions per second. The tremendous speed of today's networks as well as the increasing need for high-performance systems has made researchers interested in parallel and distributed computing. The rapid growth of distributed systems has led to a variety of problems. Task allocation is a...

متن کامل

Task Mapping on Distributed Shared Memory Systems Using Hopfield Neural Network

In order to reduce the execution time of a parallel program, the tasks/threads of the program have to be carefully mapped onto the processors of a system. Most mapping methods used on current Multithreaded Distributed Shared Memory (DSM) systems only consider the workload balance. Due to the ignorance of the communication between tasks/threads, these methods may lead to such mappings have exces...

متن کامل

Trasgo 2.0: Code generation for parallel distributed- and shared-memory hierarchical systems

1 Extended Abstract Current multicomputers are typically built as interconnected clusters of shared-memory multicore computers. A common programming approach for these clusters is to simply use a message-passing paradigm, launching as many processes as cores available. Nevertheless, to better exploit the scalability of these clusters and highly-parallel multicore systems, it is needed to effici...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009